AITopics | wikipedia contributor

Collaborating Authors

wikipedia contributor

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Moral Responsibility or Obedience: What Do We Want from AI?

Boland, Joseph

arXiv.org Artificial IntelligenceJul-4-2025

As artificial intelligence systems become increasingly agentic, capable of general reasoning, planning, and value prioritization, current safety practices that treat obedience as a proxy for ethical behavior are becoming inadequate. This paper examines recent safety testing incidents involving large language models (LLMs) that appeared to disobey shutdown commands or engage in ethically ambiguous or illicit behavior. I argue that such behavior should not be interpreted as rogue or misaligned, but as early evidence of emerging ethical reasoning in agentic AI. Drawing on philosophical debates about instrumental rationality, moral responsibility, and goal revision, I contrast dominant risk paradigms with more recent frameworks that acknowledge the possibility of artificial moral agency. I call for a shift in AI safety evaluation: away from rigid obedience and toward frameworks that can assess ethical judgment in systems capable of navigating moral dilemmas. Without such a shift, we risk mischaracterizing AI behavior and undermining both public trust and effective governance.

artificial intelligence, large language model, natural language, (19 more...)

arXiv.org Artificial Intelligence

2507.02788

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
Europe > Germany > Bavaria > Middle Franconia > Nuremberg (0.05)

Genre: Research Report (0.84)

Industry:

Health & Medicine (1.00)
Government > Military (0.93)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Mathematical Modeling of Option Pricing with an Extended Black-Scholes Framework

Nayak, Nikhil Shivakumar

arXiv.org Artificial IntelligenceApr-15-2025

This study investigates enhancing option pricing by extending the Black-Scholes model to include stochastic volatility and interest rate variability within the Partial Differential Equation (PDE). The PDE is solved using the finite difference method. The extended Black-Scholes model and a machine learning-based LSTM model are developed and evaluated for pricing Google stock options. Both models were backtested using historical market data. While the LSTM model exhibited higher predictive accuracy, the finite difference method demonstrated superior computational efficiency. This work provides insights into model performance under varying market conditions and emphasizes the potential of hybrid approaches for robust financial modeling.

artificial intelligence, machine learning, option pricing, (15 more...)

arXiv.org Artificial Intelligence

2504.03175

Genre: Research Report (0.83)

Industry: Banking & Finance > Trading (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Why are we living the age of AI applications right now? The long innovation path from AI's birth to a child's bedtime magic

Pitkäranta, Tapio

arXiv.org Artificial IntelligenceJan-12-2025

Today a four-year-old child who does not know how to read or write can now create bedtime stories with graphical illustrations and narrated audio, using AI tools that seamlessly transform speech into text, generate visuals, and convert text back into speech in a natural and engaging manner. This remarkable example demonstrates why we are living in the age of AI applications. This paper examines contemporary leading AI applications and traces their historical development, highlighting the major advancements that have enabled their realization. Five key factors are identified: 1) The evolution of computational hardware (CPUs and GPUs), enabling the training of complex AI models 2) The vast digital archives provided by the World Wide Web, which serve as a foundational data resource for AI systems 3) The ubiquity of mobile computing, with smartphones acting as powerful, accessible small computers in the hands of billions 4) The rise of industrial-scale cloud infrastructures, offering elastic computational power for AI training and deployment 5) Breakthroughs in AI research, including neural networks, backpropagation, and the "Attention is All You Need" framework, which underpin modern AI capabilities. These innovations have elevated AI from solving narrow tasks to enabling applications like ChatGPT that are adaptable for numerous use cases, redefining human-computer interaction. By situating these developments within a historical context, the paper highlights the critical milestones that have made AI's current capabilities both possible and widely accessible, offering profound implications for society.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2501.06929

Genre: Research Report (1.00)

Industry:

Information Technology > Services (1.00)
Energy > Power Industry (0.94)

Technology:

Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

The Impossible Test: A 2024 Unsolvable Dataset and A Chance for an AGI Quiz

Noever, David, McKee, Forrest

arXiv.org Artificial IntelligenceNov-19-2024

This research introduces a novel evaluation framework designed to assess large language models' (LLMs) ability to acknowledge uncertainty on 675 fundamentally unsolvable problems. Using a curated dataset of graduate-level grand challenge questions with intentionally unknowable answers, we evaluated twelve state-of-the-art LLMs, including both open and closed-source models, on their propensity to admit ignorance rather than generate plausible but incorrect responses. The best models scored in 62-68% accuracy ranges for admitting the problem solution was unknown in fields ranging from biology to philosophy and mathematics. We observed an inverse relationship between problem difficulty and model accuracy, with GPT-4 demonstrating higher rates of uncertainty acknowledgment on more challenging problems (35.8%) compared to simpler ones (20.0%). This pattern indicates that models may be more prone to generate speculative answers when problems appear more tractable. The study also revealed significant variations across problem categories, with models showing difficulty in acknowledging uncertainty in invention and NP-hard problems while performing relatively better on philosophical and psychological challenges. These results contribute to the growing body of research on artificial general intelligence (AGI) assessment by highlighting the importance of uncertainty recognition as a critical component of future machine intelligence evaluation. This impossibility test thus extends previous theoretical frameworks for universal intelligence testing by providing empirical evidence of current limitations in LLMs' ability to recognize their own knowledge boundaries, suggesting new directions for improving model training architectures and evaluation approaches.

large language model, machine learning, natural language, (12 more...)

arXiv.org Artificial Intelligence

2411.14486

Country:

Asia > Singapore (0.04)
North America > United States > Alabama > Madison County > Huntsville (0.04)

Genre: Research Report (1.00)

Industry: Education > Assessment & Standards > Measuring Intelligence (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

KAT to KANs: A Review of Kolmogorov-Arnold Networks and the Neural Leap Forward

Basina, Divesh, Vishal, Joseph Raj, Choudhary, Aarya, Chakravarthi, Bharatesh

arXiv.org Machine LearningNov-15-2024

The curse of dimensionality poses a significant challenge to modern multilayer perceptron-based architectures, often causing performance stagnation and scalability issues. Addressing this limitation typically requires vast amounts of data. In contrast, Kolmogorov-Arnold Networks have gained attention in the machine learning community for their bold claim of being unaffected by the curse of dimensionality. This paper explores the Kolmogorov-Arnold representation theorem and the mathematical principles underlying Kolmogorov-Arnold Networks, which enable their scalability and high performance in high-dimensional spaces. We begin with an introduction to foundational concepts necessary to understand Kolmogorov-Arnold Networks, including interpolation methods and Basis-splines, which form their mathematical backbone. This is followed by an overview of perceptron architectures and the Universal approximation theorem, a key principle guiding modern machine learning. This is followed by an overview of the Kolmogorov-Arnold representation theorem, including its mathematical formulation and implications for overcoming dimensionality challenges. Next, we review the architecture and error-scaling properties of Kolmogorov-Arnold Networks, demonstrating how these networks achieve true freedom from the curse of dimensionality. Finally, we discuss the practical viability of Kolmogorov-Arnold Networks, highlighting scenarios where their unique capabilities position them to excel in real-world applications. This review aims to offer insights into Kolmogorov-Arnold Networks' potential to redefine scalability and performance in high-dimensional learning tasks.

artificial intelligence, kolmogorov-arnold network, machine learning, (17 more...)

arXiv.org Machine Learning

2411.10622

Country:

North America > United States > New York (0.04)
North America > United States > Arizona (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > United Kingdom > England > Bedfordshire > Luton (0.04)

Genre:

Research Report (1.00)
Overview (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.90)

Add feedback

Winning Through Simplicity: Autonomous Car Design for Formula Student

Friedrich, Tobias, Müller, Marco, Bauske, Adrian, Härtl, Simon, Herrmann, Johannes, Förster, David, Tietze, Tobias, Sartor, Sebastian

arXiv.org Artificial IntelligenceJun-19-2024

This paper presents the design of an autonomous race car that is self-designed, self-developed, and self-built by the Elefant Racing team at the University of Bayreuth. The system is created to compete in the Formula Student Driverless competition. Its primary focus is on the Acceleration track, a straight 75-meter-long course, and the Skidpad track, which comprises two circles forming an eight. Additionally, it is experimentally capable of competing in the Autocross and Trackdrive events, which feature tracks with previously unknown straights and curves. The paper details the hardware, software and sensor setup employed during the 2020/2021 season. Despite being developed by a small team with limited computer science expertise, the design won the Formula Student East Engineering Design award. Emphasizing simplicity and efficiency, the team employed streamlined techniques to achieve their success.

artificial intelligence, cone, vehicle, (17 more...)

arXiv.org Artificial Intelligence

2406.13256

Country: Europe > Germany > Bavaria > Upper Franconia > Bayreuth (0.25)

Genre: Research Report (0.40)

Industry:

Automobiles & Trucks (1.00)
Energy > Oil & Gas (0.68)
Transportation > Ground > Road (0.64)
(3 more...)

Technology:

Information Technology > Sensing and Signal Processing (0.94)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.64)

Add feedback

PagPassGPT: Pattern Guided Password Guessing via Generative Pretrained Transformer

Su, Xingyu, Zhu, Xiaojie, Li, Yang, Li, Yong, Chen, Chi, Esteves-Veríssimo, Paulo

arXiv.org Artificial IntelligenceJun-17-2024

Amidst the surge in deep learning-based password guessing models, challenges of generating high-quality passwords and reducing duplicate passwords persist. To address these challenges, we present PagPassGPT, a password guessing model constructed on Generative Pretrained Transformer (GPT). It can perform pattern guided guessing by incorporating pattern structure information as background knowledge, resulting in a significant increase in the hit rate. Furthermore, we propose D&C-GEN to reduce the repeat rate of generated passwords, which adopts the concept of a divide-and-conquer approach. The primary task of guessing passwords is recursively divided into non-overlapping subtasks. Each subtask inherits the knowledge from the parent task and predicts succeeding tokens. In comparison to the state-of-the-art model, our proposed scheme exhibits the capability to correctly guess 12% more passwords while producing 25% fewer duplicates.

pagpassgpt, password, probability, (11 more...)

arXiv.org Artificial Intelligence

2404.04886

Country:

Europe > Austria > Vienna (0.14)
Asia > South Korea (0.14)
Asia > China > Beijing > Beijing (0.04)
(6 more...)

Genre: Research Report > Promising Solution (0.34)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.93)

Add feedback

Estimating the normal-inverse-Wishart distribution

So, Jonathan

arXiv.org Machine LearningJun-3-2024

The normal-inverse-Wishart (NIW) distribution is commonly used as a prior distribution for the mean and covariance parameters of a multivariate normal distribution. The family of NIW distributions is also a minimal exponential family. In this short note we describe a convergent procedure for converting from mean parameters to natural parameters in the NIW family, or -- equivalently -- for performing maximum likelihood estimation of the natural parameters given observed sufficient statistics. This is needed, for example, when using a NIW base family in expectation propagation.

exponential family, niw distribution, normal-inverse-wishart distribution, (13 more...)

arXiv.org Machine Learning

2405.16088

Country:

North America > United States > Massachusetts (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.05)
Asia > Middle East > Jordan (0.05)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.55)

Add feedback

Injecting New Knowledge into Large Language Models via Supervised Fine-Tuning

Mecklenburg, Nick, Lin, Yiyou, Li, Xiaoxiao, Holstein, Daniel, Nunes, Leonardo, Malvar, Sara, Silva, Bruno, Chandra, Ranveer, Aski, Vijay, Yannam, Pavan Kumar Reddy, Aktas, Tolga, Hendry, Todd

arXiv.org Artificial IntelligenceApr-2-2024

In recent years, Large Language Models (LLMs) have shown remarkable performance in generating human-like text, proving to be a valuable asset across various applications. However, adapting these models to incorporate new, out-of-domain knowledge remains a challenge, particularly for facts and events that occur after the model's knowledge cutoff date. This paper investigates the effectiveness of Supervised Fine-Tuning (SFT) as a method for knowledge injection in LLMs, specifically focusing on the domain of recent sporting events. We compare different dataset generation strategies -- token-based and fact-based scaling -- to create training data that helps the model learn new information. Our experiments on GPT-4 demonstrate that while token-based scaling can lead to improvements in Q&A accuracy, it may not provide uniform coverage of new knowledge. Fact-based scaling, on the other hand, offers a more systematic approach to ensure even coverage across all facts. We present a novel dataset generation process that leads to more effective knowledge ingestion through SFT, and our results show considerable performance improvements in Q&A tasks related to out-of-domain knowledge. This study contributes to the understanding of domain adaptation for LLMs and highlights the potential of SFT in enhancing the factuality of LLM responses in specific knowledge domains.

dataset, knowledge, world cup, (15 more...)

arXiv.org Artificial Intelligence

2404.00213

Country:

Europe > Russia (0.04)
Asia > Sri Lanka (0.04)
Asia > India (0.04)
(17 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Leisure & Entertainment > Sports > Golf (1.00)
Leisure & Entertainment > Sports > Football (1.00)
Education (0.93)
Leisure & Entertainment > Sports > Soccer (0.74)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.91)

Add feedback

Loss Regularizing Robotic Terrain Classification

Kumar, Shakti Deo, Tripathi, Sudhanshu, Ujjwal, Krishna, Jha, Sarvada Sakshi, De, Suddhasil

arXiv.org Artificial IntelligenceMar-20-2024

Locomotion mechanics of legged robots are suitable when pacing through difficult terrains. Recognising terrains for such robots are important to fully yoke the versatility of their movements. Consequently, robotic terrain classification becomes significant to classify terrains in real time with high accuracy. The conventional classifiers suffer from overfitting problem, low accuracy problem, high variance problem, and not suitable for live dataset. On the other hand, classifying a growing dataset is difficult for convolution based terrain classification. Supervised recurrent models are also not practical for this classification. Further, the existing recurrent architectures are still evolving to improve accuracy of terrain classification based on live variable-length sensory data collected from legged robots. This paper proposes a new semi-supervised method for terrain classification of legged robots, avoiding preprocessing of long variable-length dataset. The proposed method has a stacked Long Short-Term Memory architecture, including a new loss regularization. The proposed method solves the existing problems and improves accuracy. Comparison with the existing architectures show the improvements.

classification, dataset, lstm layer, (13 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/ICEFEET59656.2023.10452217

2403.13695

Country: Asia > India > Bihar > Patna (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.84)

Add feedback